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ABSTRACT 



A data transfer controller with hub and ports uses an 
effective channel priority processing technique and algo- 
rithm. Data transfer requests are queued in a first-in-first-out 
fashion at the data source ports. Each data transfer request 
has a priority level for execution. In effective channel 
priority processing the priority level assigned to a source 
port is the greatest priority level of any data transfer request 
in the corresponding first- in-first-out queue. This techniques 
prevents a low priority data transfer request at the output of 
a source port queue from blocking a higher priority data 
transfer request further back in the queue. Raising the 
priority of all data transfer requests within a source port 
queue enables the low priority data transfer request to 
complete enabling the high priority data transfer request to 
be reached. Thus both the low priority data transfer request 
and the high priority data transfer request in the queue of a 
single port are serviced before intermediate priority data 
transfer requests at the output of other source port queues. 

10 Claims, 6 Drawing Sheets 
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EFFECTIVE CHANNEL PRIORITY initiated packet transfers" (XPTs). The digital signal pro- 

PROCESSING FOR TRANSFER cessor initiated transfers are referred to as "packet transfers" 

CONTROLLER WITH HUB AND PORTS (PTs). The RISC processor could also submit packet transfer 

requests to the transfer controller 

This application claims priority under 35 USC §119(e) 5 The transfer controller with hub and ports introduced 

(1) of Provisional Application No. 60/169,369, filed Dec. 7, several new ideas concepts. The first was uniform pipelin- 

1999. ing. New digital signal processor devices containing a 

Tc/-Lixn/-AT nn ™ f^r* txic t kt\ 7T7 xtt*t/"vht transfer controller with hub and ports architecture have 

TECHNICAL HELD OF THE INVENTION multiple external ^ ^ of wfaich look tQ ^ 

The technical field of this invention is digital signal 10 hub. Thus peripherals and memory may be freely inter- 
processing and more particularly control of data transfers changed without affecting the hub. The second new idea is 

within a digital signal processing system. the concept of concurrent execution of transfers. That is, up 

to N transfers may occur in parallel on the multiple ports of 
BACKGROUND OF THE INVENTION the device, where N is the number of channels in the transfer 
Digital signal processing (DSP) differs significantly from 15 controller with hub and ports core. Each channel in the 
general purpose processing performed by micro-controllers transfer controller with hub and ports core is functionally 
and microprocessors. One key difference is the strict require- ^ a of registers. This set of registers tracks the current 
ment for real time data processing. For example, in a modem source and destination addresses, the word counts and other 
application, it is absolutely required that every sample be parameters for the transfer. Each channel is identical, and 
processed. Even losing a single data point might cause a 20 mus tne numbe r °f channels supported by the transfer 
digital signal processor application to foil. While processing controller with hub and ports is highly scalable, 
data samples may still take on the model of tasking and Finally the transfer controller with hub and ports includes 
block processing common to general purpose processing, a mechanism for queuing transfers up in a dedicated queue 
the actual data movement within a digital signal processor memory. The TMS320C80 transfer controller permitted 
system must adhere to the strict real-time requirements of 25 on ly was one transfer outstanding per processor at a time, 
the system. Through the queue memory provided by the transfer con- 
As a consequence, digital signal processor systems are troUer ^ bub and P 0 ^ processors may issue numerous 
highly reliant on an integrated and efficient direct memory transfer requests up to the queue memory size before stalling 
access (DMA) engine. The direct memory access controller ^ the di & iaX processor, 
is responsible for processing transfer requests from periph- SUMMARY OF THE INVENTION 
erals and the digital signal processor itself in real time. All 

data movement by the direct memory access must be ^ transfer controller with hub and ports of this inven- 

capable of occurring without central processing unit (CPU) 11011 ^ an improvement over that described in U.S. Pat. No. 

intervention in order to meet the real time requirements of „ 6,496,740 claiming priority from U.K. Patent Application 

the system. That is, because the CPU may operate in a No - 9909196.9 filed Apr. 10, 1999 entitled "TRANSFER 

software tasking model where scheduling of a task is not as CONTROLLER WITH HUB AND PORTS ARCHITEC- 

tightly controlled as the data streams require, the direct TURE". The improvement of this invention is effective 

memory access engine must sustain the burden of meeting channel priority processing. 

all real time data stream requirements in the system. ^ The effective channel priority processing of this invention 
The early direct memory access has evolved into several ^ lDe solution to the task of monitoring and arbitration of 
successive versions of centralized transfer controllers and conflicting transfers that could cause major performance 
more recently into the transfer controller with hub and ports degradation if only the simple channel priority assigned 
architecture. The transfer controller with hub and ports determined which transfer should proceed first The tech- 
architecture is described in U.S. Pat. No. 6,496,740 claiming 45 ni ! ue det ailed here acts to raise the effective channel priority 
priority from U.K. Patent Application No. 9909196.9 filed of a low Parity transfer task which is blocking the comple- 
Apr. 10, 1999 entitled "TRANSFER CONTROLLER WITH ^on of a hi g her priority transfer task. 
HUB AND PORTS ARCHITECTURE". This conflict comes about when both the lower priority 
A first transfer controller module was developed for the channel and the higher priority channel are accessing data 
TMS330C80 digital signal processor from Texas Instru- 50 from me same P 0 ^ but lower priority task was initiated 
ments. The transfer controller consolidated the direct on an earUer processor cycle and hence is at the front of the 
memory access function of a conventional controller along < J UCUC for mat P° rt - Elective channel priority processing 
with the address generation logic required for servicing promotes the first task in the queue of a given port to the 
cache and long distance data transfer, also called direct highest priority level represented by tasks on that queue, 
external access, from four digital signal processors and a 55 BRIEF DESCRIPTION OF THE DRAWINGS 
single RISC (reduced instruction set computer) processor. 

The transfer controller architecture of the TMS330C80 is and omer "P*** of this invention are illustrated in 

fundamentally different from a direct memory access in that e awin S s * m wnicn. 

only a single set of address generation and parameter FIG - 1 illustrates in a functional block diagram the basic 

registers is required. Prior direct memory access units 60 princip^ features of the transfer controller with hub and 

required multiple sets for multiple channels. The single set P 0 ^ and related factions; 

of registers, however, can be utilized by all direct memory FIG. 2 illustrates the queue manager interface to the 

access requesters. Direct memory access requests are posted transfer controller hub unit; 

to the transfer controller via set of encoded inputs at the FIG. 3 illustrates the transfer controller pipelines; 

periphery of the device. Additionally, each of the digital 65 FIG. 4 illustrates the pipeline M -stage of the transfer 

signal processors can submit requests to the transfer con- controller, which contains the effective channel priority 

trailer. The external encoded inputs are called "externally processing hardware; 
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FIG. 5 illustrates processing following only the simple 
channel priority assigned without the effective channel pri- 
ority technique; 

FIG. 6 illustrates processing with the effective channel 
priority technique acting to modify priorities. 5 

FIG. 7 illustrates in flow chart form, a sequence of 
effective priority updates in the process of servicing a series 
of transfers in queue. 



DETAILED DESCRIPTION OF PREFERRED 
EMBODIMENTS 



10 



The transfer controller with hub and ports architecture is 
optimized for efficient passage of data throughout a digital 
signal processor chip. FIG. 1 illustrates a block diagram of 15 
the principal features of the transfer controller with hub and 
ports. It consists of a system of a single hub 100 and multiple 
ports 111 through 115. 

The transfer controller with hub and ports functions in 
conjunction with a transfer request bus having a set of nodes 20 
117, which bring in transfer request packets at input 103. 
These transfer request bus nodes individually receive trans- 
fer requests packets from transfer requesters 116 which are 
processor-memory nodes or other on-chip functions which 
send and receive data. 25 

Secondly, the transfer controller uses an additional bus, 
the data transfer bus having a set of nodes 118, to read or 
write the actual data at the requester nodes 116. The data 
transfer bus carries commands, write data and read data from 
a special internal memory port 115 and returns read data to 30 
the transfer controller hub via the data router 150 at inputs 
104. 

The transfer controller has, at its front-end portion, a 
request queue manager 101 receiving transfer requests in the 
form of transfer request packets at its input 103. Request 35 
queue manager 101 prioritizes, stores and dispatches these 
as required. 

Request queue manager 101 connects within the transfer 
controller hub unit 100 to the channel request registers 120 ^ 
which receive the data transfer request packets and process 
them. In this process, request queue manager 101 first 
prioritizes the transfer request packets and assigns them to 
one of the N channel request registers 120. Each of the N 
channel request registers 120 represents a priority level. 45 

If there is no channel available for direct processing of the 
transfer request packet, it is stored in the queue manager 
memory 102. Queue manager memory 102 is preferably a 
random access memory (RAM). The transfer request packet 
is then assigned at a later time when a channel becomes 50 
available. The channel registers interface with the source 
control pipeline 130 and destination control pipeline 140 
which effectively are address calculation units for source 
(read) and destination (write)operations. 

Outputs from these pipelines are broadcast to M ports 55 
through the transfer controller ports I/O subsystem 110. I/O 
subsystem 110 includes a set of hub interface units, which 
drive the M possible external ports units. Four such external 
ports are shown in FIG. 1 as external ports 111 through 114. 
The external ports units (also referred to as application units) 50 
are clocked either at the main processor clock frequency or 
at a different external device clock frequency. The external 
device clock frequency may be lower than or higher than the 
main processor clock frequency. If a port operates at its own 
frequency, synchronization to the core clock is required. 55 

As an example of read-write operations at the ports, 
consider a read from external port node 112 followed by a 



write to external port node 114. First source control pipeline 
130 addresses port 112 for a read. The data is returned to the 
transfer controller hub through the data router 150. On a 
later cycle the destination control pipeline 140 addresses 
port 114 and writes the data at port 114. External ports as 
described here do not initiate transfer requests but merely 
participate in reads and writes requested elsewhere on the 
chip. Read and write operations involving the processor- 
memory (transfer requestors) nodes 116 are initiated as 
transfer request packets on the transfer request bus 117. The 
queue manager 101 processes these as described above. On 
a later cycle a source pipeline output (read command/ 
address) is generated which is passed at the internal memory 
port to the data transfer bus 118 in the form of a read. This 
command proceeds from one node to the next in pipeline 
fashion on the data transfer bus. When the processor node 
addressed is reached, the read request causes the processor- 
memory node to place the read data on the bus for return to 
the data router 150. On a later cycle, a destination pipeline 
output passes the corresponding write command and data to 
the internal memory port and on to the data transfer bus for 
writing at the addressed processor node. 

The channel parameter registers 105 and port parameters 
registers 106 hold all the necessary parametric data as well 
as status information for the transfer controller hub pipelines 
to process the given transfer. Both pipelines share some of 
the stored information. Other portions relate specifically to 
one pipeline or the other. 

FIG. 2 illustrates the interface of request queue manager 
101 to the transfer controller hub unit boundary and par- 
ticularly the request queue manager communications with 
the channel request registers 200, channel parameter regis- 
ters 105 and port parameters registers 106. Channel param- 
eters registers 105 and port parameters registers 106 store 
critical data regarding for example, types of transfers, mode 
information, status, and much other information critical to 
the transfer process. 

Channel request registers 200 pass information used in the 
source control pipeline 130 for generation of the read/pre - 
write commands 221. Similarly, channel request registers 
200 pass information used in the destination control pipeline 
140 for the generation of write command/write data words 
222. Read response data 104 from the ports is returned to the 
destination pipeline via the data router 150. 

FIG. 3 illustrates the possible pipelines in a transfer 
controller implementation. Table 1 shows the particular 
tasks performed during the pipeline stages in the preferred 
embodiment In specific implementations, one or more 
stages may be combined but the tasks for the individual 
pipeline stages are essentially as shown in Table 1. 



TABLE 1 


Pipeline Stage 


Fbncticm 


Q 


Interrogates state of queues within ports 


M 


Maps port ready signals to channels 


P 


Prioritize highest priority channel with 




ready ports 


AO 


First half of address update cycle 


Al 


Second half of address update cycle 


C 


Issues command to ports 



The channel request registers 200 pass information used 
in the source pipeline stages 301 to 306 for generation of the 
re ad/pre -write commands 221. Similarly, the channel 
request registers 200 pass information used in the destina- 
tion pipeline stages 311 to 315 for the generation of write 
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command/write data words 222. Read response data 104 FIG. 5 illustrates an example of effective priority pro- 

from the ports is returned to the destination pipeline via the cessing. Consider a possible scenario of two transfers in 

data router 150. queue where an arbitration of priorities can make a signifi- 

FIG. 4 illustrates the pipeline M -stage. The pipeline cant difference in processor performance. Suppose two 

M-stage and P-stage will play the most significant part in 5 channels (say port 5, with highest priority and port 0, with 

carrying out the effective channel priority processing tech- lowest priority) are using the same source port, and the data 

nique of this invention. It is helpful to consider the make-up at tbe front of the response queue is from the lower priority 

of the M-stage in further detail. The pipeline M-stage channel (0). If there was no effective priority processing, 

interfaces with the ports. These ports send response infor- SUCD as indicated in FIG. 5, then the following could happen, 

raation 421 of completed reads which need to be serviced by 10 If ot her ports in the system have data available at the front 

the transfer controller. This information is then sent to the of their response queues for any of the channels 4 through 

port response FIFO 401 in the pipeline M-stage, then 1, as is the case for ports 112 through 115 in the example 

prioritized in pipeline P-stage. The pipeline M-stage is illustrated in FIG. 5, then these other ports will get serviced 

responsible for the mapping from port to channel. Note ahead of the port with data for channel 0. 

channel related output signals 451 tfarangh 454.% ptpelioe * ^ k ^^blc as far as channel 0 is concerned. 

P-stage evaluates if a channel is ready and prioritizes the Unfortunately this means that channel 5, which is supposed 



to be high priority, is actually, getting low priority perfor- 



command. 

The pipeline M-stage maps the port information on a per mance. This is because the channel 5 data transfer request 

channel basis. This is then analyzed and prioritized in the 520, which is waiting to be serviced, is blocked behind 

pipeline P-stage to select the channel to process further. FIG. 20 channel 0 data transfer 500. The data transfers 521 through 

4 illustrates the signal information that is generated for each 524, which reside at the front of each respective source port 

channel from the received port information. The source response queue, are assigned the priority of that specific 

ready signal 451 flags when the source port for the channel channel. These are each assigned a priority ahead of channel 

has read data ready for the transfer controller to process. The 0 data transfer 500. Thus ordinarily, channel 5 data transfer 

data router unit ready signal, 453 similarly flags when data 25 request 520 would be services following service of the lower 

routing unit 150 has left over read data for the channel that priority data transfers 521 through 524. The following 

needs to be processed. The destination ready signal 454 flags remedy, which allows for dynamic adjustments to the effec- 

when the destination port for the channel has available tive priority of a channel, is implemented in the improved 

queue space (internal memory port space). This is only taken transfer controller of this invention, 
into account for internal memory port transfer. The effective 30 . 

channel signal 452 is the channel number of the channel that In effectlve channel P riorit y processing, the source ports 

should be effectively raised to the priority of the present ^wg ^ b y the currently active channels are examined on 

channel a °y c ^ c °asis to determine the highest priority channel using 

There aie three main components of the pipeline M-stage. „ ea ? ^ ^ P™P"f» o( . P ri " 

TT>e port response FIFO Wholds the response information 35 the destination pipehne all channels usmg a 

from the ports inside the transfer controller hub. The pipe- p0rt * e * en P ™ n * ltVe } ° f the . h « nest 

line M-stage also includes read queue counters 402 and pn ° nty ^ V°± P.™" 1 * 

«- . - ? M need only be revised when a channel either completes or 

effective priority processing unit 403. « ^ * r r™. , \ , 

t . , ft* t < „ starts a transfer. This means that several cycles can be taken 
A multicycle delay occurs between the time of a new ^ to recomputc ^ effective priority levels m order to avoid 
response from a port denoting data ready- and the time that multiple other operations which 
* C J^ C 5 hub ^an send a aclojowledgment tend to mtroduce ^ degradation. Note that thisapproach 
(ACK) back to the port signifying that the hub will process does not ^ tbe order m which data & extracted from a 
the data. In order to be ab e .to process a^w response every ^ ven rt ^ This is still a strict 
clock even with th^ multiple-cycle ACK delay, multiple- 45 first _ m _ first _ out sequence. It simply ensures that high prior- 
entry port response FIFO 401 is used to keep track of the last it daU does ^ become for excessive cydes 
three responses from the port that have not generated an behind low priority data. 
ACK signal. In port response FIFO 401, only the first 

response may be serviced. Because of the ACK delay, one FIG. 6 illustrates the results of effective channel priority 

more response may come from the port before it receives a 50 processing for the same example of transfers as illustrated in 

signal that the first response could not be processed, thus FIG. 5 * Note that data transfer 500 has been promoted to 

stalling the port. Port response FIFO 401 advances when the priority 5 (highest priority) because there is in that source 

pipeline M-stage and P-state respond indicating which port, P° rt response queue a true priority 5 data transfer request 

if any, will receive the ACK in the next cycle. 520 awaiting service. The other data transfers 521 through 

Both the destination pipeline and the source pipeline use 55 524 > whfcn Icside at me 60111 of eacn respective source port 

read queue counters 402. Locally, the counter is decre- response queues are now also assigned the priority of the 

mented when the port is selected for an operation in the highest priority channel transfer present in that particular 

pipeline M-stage and P-state. This counter is incremented source port response queue. In this example data transfers 

when receiving a signal sent indicating that a queue entry 521 ' 522 > and 523 are the highest priority in their respective 

was emptied. go * M)rt res P onse queues and thus no promotion takes place. 

The ports and port response FIFO 401 can only be FIG. 7 illustrates the result of effective channel priority 

serviced in order. Thus there is a need for the ability to raise processing for this particular set of example transfers in the 

the effective priority when the data belonging to a low five source port response queues. Block 700 starts with the 

priority channel is blocking a port from reading and pro- previous effective channel priority assignments shown in 

cessing the data for a higher priority channel. Tbe functional 65 detail in block 701 where simple priority is contrasted with 

block which handles this operation is a part of the pipeline effective priority. For example data transfer 500 is shown 

M-Stage effective priority processing unit 403. with simple priority "0" and effective priority level elevated 
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to "5". Transfers 522, 521, and 523 retain their simple 
priorities. The result of completing transfer 500 in block 702 
is that priorities are re-evaluated in block 703, and new 
effective priority assignments are carried out in block 704 
and are shown in detail in block 705. The flow chart shows 
subsequent transfers completed in blocks 706, 710, and 714 
for transfers 520, 522, and 528 respectively. 

Because no new additional transfers are shown to have 
been requested during this sequence, source port response 
queue 113 empties after data transfer 528 in block 714. The 
individual blocks 705, 709, 713 and 717 show how the 
effective priority adjustments could result in the higher 
priority transfers being completed despite their place in 
queue behind lower priority transfers. 

What is claimed is: 

1. A data transfer controller comprising: 
a plurality of ports; 

a plurality of first-in-first-out source port data transfer 
request queues, each of said plurality of first-in-first-out 
source port data transfer request queues corresponding 
to one of said plurality of ports and adapted for storing 
data transfer requests specifying said corresponding 
port as data source, each data transfer request specify- 
ing a data source, a data destination, a data quantity to 
be transferred and a priority level; 

a data transfer hub connected to said plurality of ports and 
to said plurality of first-in-first-out source port data 
transfer request queues, said data transfer hub receiving 
a next data transfer request from each of said first-in- 30 
first-out source port data transfer request queues, pri- 
oritizing and dispatching said data transfer requests 
between ports, said data transfer hub further controlling 
data transfer from a source port corresponding to said 
data source to a destination port corresponding to said 35 
data destination in a quantity corresponding to said data 
quantity to be transferred of a currently executing data 
transfer request in a source port priority corresponding 
to said priority level; and said data transfer hub further 
setting said source port priority of a next data transfer 40 
request from each of said first-in-first-out source port 
data transfer request queues equal to the highest prior- 
ity of any data transfer request stored in said corre- 
sponding first-in-first-out source port memory transfer 
request queue. 

2. The data transfer controller of claim 1, wherein: 
said data transfer hub setting said source port priority for 

all ports upon completion of each data transfer request. 

3. The data transfer controller of claim 1, further com- 
prising: 

an internal memory; and 

at least one of said plurality of ports is connected to said 
internal memory. 

4. The data transfer controller of claim 1, wherein: 
at least one of said plurality of ports is adapted for 

connection to an external memory. 
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5. The data transfer controller of claim 1, wherein: 

at least one of said plurality of ports is adapted for 
connection to an external peripheral. 

6. A data processing system comprising: 

a plurality of data processors, each data processor capable 
of generating data transfer requests, each data transfer 
request specifying a data source, a data destination, a 
data quantity to be transferred and a priority level; 

a plurality of ports; 

a plurality of first-in-first-out source port data transfer 
request queues, each of said plurality of first-in-first-out 
source port data transfer request queues corresponding 
to one of said plurality of ports and adapted for storing 
data transfer requests specifying said corresponding 
port as data source, each data transfer request specify- 
ing a data source a data destination, a data quantity to 
be transferred and a priority level; 

a data transfer hub connected to said plurality of data 
processors, said plurality of ports and to said plurality 
of first-in-first-out source port data transfer request 
queues, said data transfer hub receiving a next data 
transfer request from each of said first-in-first-out 
source port data transfer request queues, prioritizing 
and dispatching said data transfer requests between 
ports, said data transfer hub further controlling data 
transfer from a source port corresponding to said data 
source to a destination port corresponding to said data 
destination in a quantity corresponding to said data 
quantity to be transferred of a currently executing data 
transfer request in a source port priority corresponding 
to said priority level; and 

said data transfer hub further setting said source port 
priority of a next data transfer request from each of said 
first-in-first-out source port data transfer request queues 
equal to the highest priority of any data transfer request 
stored in said corresponding first-in-first-out source 
port memory transfer request queue. 

7. The data processing system of claim 6, wherein: 
said data transfer hub setting said source port priority for 

all ports upon completion of each data transfer request. 

8. The data processing system of claim 6, further com- 
prising: 

a plurality of internal memories, each of said plurality of 
internal memories corresponding to one of said data 
processors; 

one of said plurality of ports is connected to said plurality 
of internal memories. 

9. The data processing system of claim 6, wherein: 

at least one of said plurality of ports is adapted for 
connection to an external memory. 

10. The data processing system of claim 6, wherein: 

at least one of said plurality of ports is adapted for 
connection to an external peripheral. 



